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ABSTRACT 

Reducing the scatter between cluster mass and optical richness is a key goal for cluster cosmology from 
photometric catalogs. We cons ider various modifications to the red-sequence matched filter richness 
estimator of lRozo et al.1 (|2009bf ) , and evaluate their impact on the scatter in X-ray luminosity at fixed 
richness. Most significantly, we find that deeper luminosity cuts can reduce the recovered scatter, 
finding that <7\ n L x \x = 0.63 ± 0.02 for clusters with M^ooc ^ 1-6 x 10 14 hj^ M©. The corresponding 
scatter in mass at fixed richness is <J\ n M\\ ~ 0.2 — 0.3 depending on the richness, comparable to that 
for total X-ray luminosity. We find that including blue galaxies in the richness estimate increases 
the scatter, as does weighting galaxies by their optical luminosity. We further demonstrate that our 
richness estimator is very robust. Specifically, the filter employed when estimating richness can be 
calibrated directly from the data, without requiring a-priori calibrations of the red-sequence. We also 
demonstrate that the recovered richness is robust to up to 50% uncertainties in the galaxy background, 
as well as to the choice of photometric filter employed, so long as the filters span the 4000 A break of 
red-sequence galaxies. Consequently, our richness estimator can be used to compare richness estimates 
of different clusters, even if they do not share the same photometric data. Appendix [A] includes 
"easy-bake" instructions for implementing our optimal richness estimator, and we are releasing an 
implementation of the code that works with SDSS data, as well as an augmented maxBCG catalog 
with the A richness measured for each cluster. 
Subject headings: galaxies: clusters - X-rays: galaxies: clusters 



1. INTRODUCTION 

In the next few years, a host of large scale optical 
surveys — e.g. the Dark Energy Survey (DES 12 ), the 
Panoramic Survey Telescope & Rapid Response Sys- 
tems (Pan-ST ARRS 13 ), Hypcr-Suprime Camera (HSC; 
Takada 2010), and the Large Synoptic Survey Telescope 
(LSST 14 ) — are expected to generate galaxy catalogs 
spanning several thousands of square degrees to suffi- 
cient depth to reliable detect galaxies at redshifts as 
high as z « 1. These surveys will be used to optically 
select galaxy clusters, and in conjunction with stacked 
weak-lensing mass calibration, can be used to place tight 
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constr ai nts on cosmological par ameters (e.g. iRozo et all 
I2010allbl : [Qeuri fc Takadall20ll . 

One of the difficulties confronting cosmology with op- 
tical clusters is the fact that optical richness estimates 
are expected to be noisy tracers of the underlying halo 
mass. 15 This is particularly problematic because the sen- 
sitivity of cluster abundance studies is sensitive to the 
uncertainty in the scatter of the mass-observable rela- 
tio n, and this sensiti vity increases with increasing scat- 
ter ([Lima fc Hu 2003). In addition, high scatter increases 
the sensitivity of cluster abundance measurements to 
non- Gaussian fluctuat ions in the observable-mass rela- 
tion (Sh aw et al.ll2010t ). which are often degenerate with 
cosmological parameters. Consequently, in order to min- 
imize the dilution of the cosmological information of op- 
tically selected cluster samples, a richness estimator that 
minimizes the scatter in the richness-mass relation is 
highly desirable. 

As an example of the magnitude of this problem, we 
consider the scatter in m ass at fixed r i chness for the 
maxBCG cluster catalog (jKoester et al.l r2~007al ). which 
is currently the best-studied optica lly selected cluster 
catalog at mode r ate redshifts (e.g. iBecker et al.l [20071: 
Rozo et all 120071: ISheldon et al.l l2009at Uohnston et al 



.20071: iRvkoff et alJ2008tlHansen et all2009f ). IRozo et aL 
(|2009alT finds that the scatter in mass at fixed richness 
(A200) f° r m axBCG clusters is <J\nM\N = 0.45 ±0.1 
for clusters with M 200 > 10 14 /i _1 M©. For compari- 
son, X-ray luminosity, which is the noisiest X-ray mass 



15 As discussed in Rozo ct al. (2009b), throughout this work the 
word "richness" is meant to be understood as "optical mass tracer," 
and is not necessarily the actual number of cluster galaxies within 
the virialized region of a cluster or the total optical luminosity of 
the cluster. 
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estimator, has a scatter of a\ n M\L x = 0.25 — 0.32 
(|Vikhlinin et all 120091: iMantz et all |201Q[ ). 16 . This is 
comparable to the scatter in halo mass at fixed weak lens- 
ing mass, which is also estimated to be about o~\ n m\WL = 
0.25 - 0.30 (|Becker fc Kravtsovl[2010f) . Clearly, there is 
room for improvement for optical mass tracers. 

Indeed, it has been argued on the basis of nu- 
merical simulations that the intrinsic scatter of the 
richn e ss-mass relation is Poisson (e.g. Kravtsov et al.l 
l200i IBerlind et all [20031: IZheng et all |2005J). While 



recent work indicates the scatter may be significantly 
super-Poisson at the cluster scale (iBovlan-Kolchin et all 
l2010t IWetzel fc Whitdl2010t IBusha et al.ll2010D . even in 
this case the intrinsic scatter is expected to be closer 
to (J\ n M\N = 0.20 — 0.25 rather than 0.45 at A/ 2 oo ~ 
2 x 10 14 Mq, so it is apparent that the maxBCG richness 
estimator is dominated by extrinsic sources of scatter. 

This is the third in a series of papers whose goal is to 
develop improved richness estimators that arc both qual- 
itatively and quantitatively understood in detail. The 
fir st of these papers , henceforth referred to as Paper 
I (|Rozo et al.l I2009TJ ) . laid the fundamental framework 
of our new optical richness estimator and quantitative 
techniques. There, we demonstrated that by relying on 
a probabilistic approach toward red sequence color se- 
lection, combined with an aperture optimization, one 
achieves much more robust richness estimates. 

In this pap er and Paper I (see also lPopesso et al1l2004t 
Hopes et al.fl2006[ ) we use X-ray luminosity Lx, as our 
mass proxy. Our chosen figure-of-mcrit is the scatter 
in Lx at fixed richness, <j\ n l x \n, where N is an ar- 
bitrary richness. The most important reason why we 
choose this metric is that it is easily available for our large 
cluster cata l og via the ROSAT All-Sky Survey (RASS: 
IVoges et al.1 Il999h . That said, there are strong physi- 
cal motivations for relying on X-ray luminosities for this 
study. Specifically, not only is the scatter in mass at fixed 
Lx smaller than the scatter in mass at fixed richness 
(°M|Lx ~ 0.25 compared to <J M \ N200 « 0.45), the corre- 
lation coefficient between Lx and mass at fixed r ichness 
is very nearly unity (r > 0.9: iRozo et al.ll2009"aT ). That 
is, at fixed A200, clusters that are brighter in X-rays are 
also more massive. Consequently, we feel confident that 
reducing the scatter in the Lx _ richness relation will also 
reduce the scatter in the mass-richness relation. Our fi- 
nal richness estimator from Paper I was easily superior 
to that of the maxBCG cluster catalog, with a scatter 
in Lx at fixed richness of <J\ n L x \N = 0.69 ± 0.02, com- 
pared to <Ti n L x |jv = 0.86 ±0.02 for the maxBCG richness 
esti mator. 

In IRozo et "ail ([201 lL henceforth Paper II) , we inves- 
tigated how extrinsic sources of scatter can impact the 
observed scatter in the richness-mass relation. In that 
work, we demonstrated that while optical richness is in 
principle subject to many sources of noise, in practice 
only a very small subset of these is observationally rel- 
evant. For instance, both photometric errors in galaxy 
magnitudes/colors and photometric redshift uncertain- 



16 IVikhlinin eTaLl lf2009T ) quote a llLLx \ M = 0.396, and L x oc 
M 161 , which corresponds to a scatter in mass at fixed Lx of 
°ln M\Lx = 0.396/1.61 = 0.25. The same calculation using the 
results of lMantz et al.l 1120101 ') gives cr lnM | Lx = 0.32. 



ties in cluster rcdshifts arc unimportant with SDSS- 
quality data or better. In Paper II we demonstrate that 
there are two dominant sources of noise. The first, which 
is an issue for all photometric cluster catalogs, is the den- 
sity of background galaxies within which a cluster is em- 
bedded. Because of galaxy clustering, this background 
exhibits large cluster-to-cluster fluctuations, so a small 
percentage of galaxy clusters end up embedded in very 
large galaxy overdensities. Such occurrences inevitably 
result in gross richness over-estimates, i.e. projections 
onto correlated structures. The second is failing to iden- 
tify the correct center of the galaxy clusters. This effect 
leads to significant underestimation of cluster richness if 
the centering offset is comparable to the aperture used to 
estimate richness, and can be mitigated with improved 
optical centering algorithms. 

In this work, which we refer to as Paper III, we inves- 
tigate which parameters from Paper I may be changed 
to further improve the fidelity of our optical richness es- 
timator. By monitoring the change in scatter, <J\ n L x \N, 
we can directly quantify which parameters significantly 
improve the richness estimator. The specific modifica- 
tions we consider are whether cluster richness can be 
improved by counting blue cluster galaxies in addition 
to the red-sequence galaxies; by summing red-sequence 
optical luminosity rather than galaxy counts; and the im- 
pact of measuring galaxies further down the luminosity 
function. 

In addition to exploring these various modifications, 
we also test the robustness of our richness estimator to 
various perturbations, similar to the tests made on sim- 
ulated data in Paper II. This includes a measurement of 
the bias and scatter of the richness when different optical 
filters are used to isolate the red sequence. Finally, we 
demonstrate the origin of the optimal radial and lumi- 
nosity cuts using the methods of Paper II. 

The end result of this investigation is a new richness 
estimator that is both robust and, we believe, very close 
to optimal. Importantly, because this is a stand-alone 
richness estimator, our method can be applied to any 
cluster catalog, and can therefore improve optically se- 
lected cluster catalogs regardless of how the initial clus- 
ter selection is done. Moreover, the insights that we have 
gained while performing this work are now informing a 
new cluster finding algorithm that revolves around the 
probabilistic framework of our richness estimator. A de- 
tailed comparison of our richness estimator to other es- 
timators from the literature will be presented in a future 
paper. 

The paper is set up as follows: in Section [2] we intro- 
duce the data sets upon which our analysis is based. Sec- 
tion [3] briefly reviews the richness estimator from Paper I 
and sets up the probabilistic framework employed in this 
paper. Section [4] describes how we define our figure-of- 
merit for assessing improvement in optical richness esti- 
mation, as well as the various modifications we consider. 
Section [5] tests the robustness of our richness estimator 
to various sources of systematic errors, most notably the 
choice of filters used to select red-sequence galaxies, as 
well as the exact values of the model parameters that 
define the filter employed in our richness estimates. Sec- 
tion [6] follows the work of Paper II to show the origin of 
the optimal radial and luminosity cuts. Our conclusions 
are presented in Section [7] We have also summarized 
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all the relevant information required to code our new 
richness estimator in Appendix [A] Finally, Appendix [B] 
provides a tentative mass-richness relation for our opti- 
mal estimator. Wc emphasize, however, that the problem 
of deriving a robust calibration, with well characterized 
uncertainties, is left for future work. 

2. DATA 

All data used in this work comes from two large 
area surveys, t he Sloan Digital Sky Survey (SDSS: 
York et al.l20"o0h and the ROSAT All-Sky Survey (RASS: 
Voges et alJUoOflV SDSS imaging data is used to select 
clusters and to measure their matched filter richness, 
while RASS data provides 0.1-2.4 keV X-ray fluxes for 
each cluster. 

The input data and analysis in this work is similar to 
Paper I, although there are some key differences noted 
below. Of particular note is the change from an input 
galaxy catalog based on SDSS DR4 to one based on SDSS 
DR7. Here we summarize the key aspects of the analysis. 
For full details see Paper I. 

2.1. Cluster Sample 

Following Paper I, cluster locations, redshift esti- 
mates, and initial richnes s estimates are taken from the 
maxBCG cluster catalog (jKoester et al.ll2007al f0). an op- 
tically selected cluster catalog. The maxBCG algorithm 
identifies galaxy clusters by relying on the observation 
that the galaxy population of massive halos clusters 
tightly in space and color, forming wh at is known as 
the E/SO ridgeline or red-sequence (e.g. lDressleH [l98l 
iKormendv fe Djorgovskll98alHansen et al.ll2009j ). This 
feature allows for high-contrast detection of galaxy clus- 
ters with optical data, both locally and out to high red - 
shift(e.g. iGladders fe Yeel 120001 : lEisenhardt et alJl2008T ). 

The maxBCG catalog is approximately volume limited 
in the redshift range of interest (0.1 < z < 0.3), with 
very accurate cluster photometric redshifts (Sz ~ 0.01). 
Studies with mock SDSS catalogs i ndicate that the com- 
pleteness and pur ity are above 90% (|Koester et al.|[2007at 
iRozo et aLll2007f ). The maxBCG catalog has been used 
to investigate the scaling of multiple cluster mass prox- 
ies w ith richness, includ ing line-of-sight ve locity disper- 
sion (iBecker et al.l l2007f). X-ray l uminosity (iRvkoff et al.l 
I2008T) . and weak lensing shear ([Sheldon et all l2009al ). 
as well as derive cosmo l ogical constraints from cluster 
counts (|Rozo et alJl2007ll2010ah . 

The richness estimator used in the maxBCG catalog 
is N200, defined as the number of galaxies with g — r 
colors within 2a of the E/S0 ridgeline as defined by the 
BCG color, that are brighter than 0.4 (in i-band), 

and fo und within a scaled aperture j-^qq of the cluster 
center ([Hansen et al.ll2005l ). The full catalo g comprises 
13, 823 objects with a richness threshold -/V200 > 10, cor- 
respo nding to M > 5 • 10 13 h- 1 M Q ([Johnston et all 
l2007h . 

2.2. X-ray Measurements 

The scatter in Lx at fixed richness, <J\ n L x \N (where 
N is an arbitrary r ichness) is estimate d using the meth- 
ods of Paper I and IRvkoff eTail ([20081 ). In brief, we use 
the RASS photon maps to estimate the 0.5-2.0 keV X- 
ray flux at the location of each cluster, which is turn 



used to derive Lx [0.1-2.4 keV] given the cluster's pho- 
tometric red shift (the conversion f actors are similar to 
those used in lBohringer et aT1l2004l ). We then perform a 
Bayesian linear regression to ItiLx as a function of In N, 
where N is the richness parameter to be tested. The vari- 
ance in hiLx is included as a free parame ter. The fit i s 
done following the algorithm presented in iKellvl (|2007l) , 
and correctly takes into account errors on the indepen- 
dent variable as well as upper limits on Lx for those 
clusters without significant detection of X-ray emission. 
This method has several advantages, in that it takes into 
account all the available X-ray data, not only that for 
clusters in X-ray catalogs. 

When estimating Lx, one must specify an aperture. 
The matched filter richness estimators described in this 
paper have the benefit of assigning a cluster radius R Cl 
to each individual cluster. As in Paper I, we estimate 
Lx using the aperture derived from the cluster richness. 
Alternatively, using a fixed 0.9 h~ x Mpc aperture to esti- 
mate Lx does not have a significant eff ect on our results. 

As discussed in IRvkoff et al.l (|2008l see Section 5.6), 
there is clear evidence that strong cool core clusters 
increase the scatter in X-ray cluster properties. High 
resolution X-ray imaging of clusters allows the exclu- 
sion of cluster cores, r educing the scatter in observed 
X-ray properties (e.g . lO'Hara et al.l 120061 : IChen et all 
I2007t iMaughar] 120071 ). Unfortunately, the broad PSF 
of ROSAT means that it is impossible to do so in this 
work. In Paper I, wc analyzed both the full sample of 
maxBCG clusters as well "clean" sample after re- 
moving all 10 known strong cool core maxBCG clusters 
that may significantly bias our results (see Section 2.4 in 
Paper I) . We concluded that although the absolute value 
of the scatter in Lx at fixed richness is reduced by using 
the "clean" sample, the same general trends were evident 
with the full and "clean" samples. In this work, we focus 
exclusively on the "clean" sample of Paper I. 

2.3. Input Galaxy Catalog 

The input galaxy catalog for this work i s derived from 
SDSS DR7 data (jAbazajian et al.l 120091 ). This data 
release includes nearly 10000 square degrees of drift- 
scan imaging in the Northern Galactic Cap. How- 
ever, as the maxB CG cluster catalog was created us- 
ing data from DR4 ([Adelman-McCarthv et al.ll2006l ). the 
relevant area covered is ~ 7500 square degrees. Sur- 
vey edges, regions of poor seeing, and bright stars are 
masked as pr eviousl y described dScranton et al.l 120021 : 
iKoester et atll2007at ISheldon et al. 2009b). In this work 
we use CM0DEL_CDUNTS as our total magnitude, and 
M0DEL_C0UNTS when computing colors. All magnitudes 
are corrected for Galactic extinction. 

The careful selection of a clean input catalog is required 
for proper richness estimation. In Section [5"Hl we discuss 
the effects of "catalog noise" on the richness-mass re- 
lation, by which we mean the inclusion of stars and/or 
artifacts as well as catastrophic photometric errors in 
the galaxy catalog employed when estimating richnesses. 
Our be st input catal o g was based on the same cuts 
used in ISheldon et al.l (|2009b[ ). After selecting galaxies 
based on the default SDSS star/galaxy separator, we fil- 
ter all objects with any of the following flags set in the 
g,r, or i bands: SATURATED, SATUFLCENTER , BRIGHT, 
N0PETR0, DEBLENDED_AS_M0VING. These cuts remove ~ 
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30% of the objects brighter than i > 22, including a 
significant number of relatively bright stars that are er- 
roneously tagged as galaxies in the SDSS pipeline. 

3. THE MATCHED FILTER RICHNESS A 

Working within the same theoretical framework as Pa- 
per I, we now attempt to improve upon the richness esti- 
mator A advocated in that paper. Consequently, we now 
review the richness estimator A as described in Paper I. 

Let x be a vector describing the observable properties 
of a galaxy (e.g. galaxy color, magnitude, and position). 
We model the projected galaxy distribution around clus- 
ters as a sum 5(x) = Au(x|A) + 6(x) where A is the 
number of cluster galaxies, u(x|A) is the cluster's galaxy 
density profile normalized to unity, and 6(x) is density of 
background (i.e. non-member) galaxies. The probability 
that a galaxy found near a cluster is actually a cluster 
member is simply 



p(x) 



Am(x|A) 



Au(x|A) + 6(x) ' 



(1) 



The total number of cluster galaxies A must satisfy the 
constraint equation 



A 



p(x|A) 



E 



Au(x|A) 



R<RaW 



Au(x|A) + 6(x) 



(2) 



The corresponding statistical uncertainty in A is given by 
(see Paper II) 



Var(A) = 5>(x|A) [l-p(x|A)] 



(3) 



In principle, these sums should extend over all galaxies. 
In practice, one needs to add over all galaxies within some 
cutoff radius R c and above some luminosity cut L cut . In 
Paper I, L cut was set to L cut = 0.4L*, while the radial 
cut was assume to scale as a power-law with A, such that 

iZc(A) =i? (A/100.0)' 3 . (4) 

The most important thing to note about Eqns. [2] and 0] 
is that the cluster richness is the only unknown. Con- 
sequently, one can numerically solve Eqn. [2] for A using 
simple zero-finding algorithm. This automatically pro- 
duces a cluster radius estimate R c via equation 01 

In Paper I we considered three observable properties of 
galaxies: R, the projected distance from the cluster cen- 
ter; m, the galaxy i-band magnitude; and c, the galaxy 
g — r color. We adopted a separable filter function 



u(x) = [2irRE(R)](f>(m)G(c), 



(5) 



where is the two dimensional cluster galaxy density 
profile, 4>(m) is the cluster luminosity function (expressed 
in apparent magnitudes), and 67(c) is color distribution 
of cluster galaxies. The prefactor 2wR in front of £(i?) 
accounts for the fact that given S(i?), the radial proba- 
bility density distribution is given by 2nRH(R). In Paper 
I, we showed that the color filter is by far the most im- 
portant of the three filters in reducing the scatter. We 
summarize below the specific filters employed in Paper I. 

3.1. The Radial Filter 

Fo r the radial fil t er, P aper I adopted an NFW pro- 
file n avarro et all 11995(1 . which is a good descrip- 
tion of the dark matter profile in N-body simulations, 



and is found to be a good descriptor of the distribu- 
tion of cluster galaxies dLin &: Mohrll2004l : lHansen et al.l 
120051: iPopesso etHI 120071) . The c orresponding two- 
dimensional surface density profile is (lBartelmannlll996T) 



T,(R) oc 



1 



-f(R/Rs), 



{R/R s f-r 

where i? s is the characteristic scale radius, and 



m = i - 



i 



: tan 



1 



1 



(6) 



(7) 



This formula assumes x > 1. For x < 1, one uses the 
identity tan ~ 1 (ix) = itanh ( x). 

Following iKoester etaLl <|2007af ). Paper I set R s = 
0.15 hr 1 Mpc. Also, in order to avoid the singularity at 
R = 0, they assumed £ was constant for R < R core = 
0.1 Mpc. This core density is chosen so that the mass 
distribution S(i?) is continuous. Finally, the profile S(J?) 
is truncated at the cluster radius i? c (A), and is normal- 
ized such that 

r-Re(A) 

1 = c / dR 2irRE(R). (8) 



For the NFW profile with the given values of R s and R c . 
the normalization factor c can be parametrized as 

c=exp(1.6517 - 0.5479/j + 0.1382p 2 - 0.0719/r 3 - 

0.01582p 4 - 0.00085499p 5 ), (9) 

where p = ln{R c ) and 0.001 < R c < 3. 

3.2. The Luminosity Filter 

The luminosity distribution of maxBCG clusters is well 
repre sented by a Schechter function (e.g. lHansen et al.l 
2009J) which we write as 

<f>(m) cx io-°- 4 ( TO -™«)(«+i) ex p ^_Kn - 4 ( m -" 1 -)) . (10) 

Paper I set a = 0.8 independent of redshift. The charac- 
teristic magnitude, to*, is calculated for a k-corrected 
passiv ely evolving stellar population l|Koester et al.l 
I2007bh . We assume Ml = —21.22 for red galaxies, corre- 
sponding to 2.25 x 10 10 Lq. A PE GASE.2 stellar Pppula- 
tion/galaxy formation model fe.g. lEisenstein et al.ll200ll) 
was used to calculate the fc-corrected magnitude at each 
redshift. In the redshift range 0.05 < z < 0.35 appropri- 
ate for maxBCG, m*(z) is well approximated by a fourth 
order polynomial: 

m*(z) = 12. 27 + 62. 36z-289.79z 2 + 729.69z 3 -709.42z 4 . 

(11) 

For each cluster, to* is taken at the appropriate red- 
shift and the filter is normalized by integrating down to 
the magnitude cutoff. In Paper I, this was chosen to be 
0.4i*, or to* + 1 mag. 

3.3. The Color Filter 

The old stellar populations in the red sequence galax- 
ies have a prominent 4000 A break in their spectra. In 
the redshift range targeted by maxBCG, z < 0.35, the 
4000 A break is located in the g band. Therefore, the 
g — r color of red-sequence galaxies correlates strongly 
with redshift, and results in tight E/S0 ridgelines. Con- 
sequently, Paper I relied on c = g — r for their color 
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filter. They assume G(c) is Gaussian with a small in- 
trinsic dispersion of <7i n t = 0.05 mag. The corresponding 
color filter, G(c) is 

(c-(c\z)) 2 ' 



G(c\z) 



1 



2ira 



exp 



2a 2 



(12) 



where c = g — r is the color of interest, (c\z) is the mean of 
the Gaussian color distribution of early type galaxies at 
redshift z, and the net dispersion a is the sum in quadra- 
ture of the intrinsic color dispersion <7i nt = 0.05 and the 
estimated color error a c . The mean color (c\z) = 0.625 + 
3.149z was determined by matching maxBCG cl uster 
member s to the SPSS LRG (lEisenstein et al.ll200ll ) and 
MAIN ([Strauss et al.l |2002[ ) spectroscopic galaxy sam- 
ples. In Section 14.31 we investigate modifications of this 
color model based on measureme nts of the red seq uence 
of maxBCG clusters measured in lHao et al.l (|2009t ). 

3.4. Background Estimation 

The last necessary ingredient for estimating A is a back- 
ground model. We assume the background galaxy den- 
sity is constant in space, so that 6(x) = 2nRY, g {mi,c) 
where E 9 (m, , c) is the galaxy density as a function of 
galaxy i— band magnitude and g — r color. The mean 
galaxy density is obtained by binning the full galaxy cat- 
alog in color a nd magnitude using a cloud- in-cells (CIC) 
algorithm fe.g. lHocknev &: Eastwoodlll981[ ) and dividing 
by the survey area. For our cells, we use 40 evenly-spaced 
bins in g-r € [-1,3] and 100 bins in i e [12,22]. The fi- 
nal galaxy number density is normalized by the width of 
each color and magnitude bin (0.1 mags each). We mark 
as "bad" all cells that have fewer than 5 galaxies/sq. deg. 
This has the effect of masking out erroneous photometric 
artifacts that are called bright galaxies in the DR7 cat- 
alog. Although these artifacts are rare, they may never- 
theless significantly bias the luminosity- weighted richness 
for a few clusters (sec Section |4~6]) . 17 We emphasize that 
because the background is measured per square degree, 
the average number of background galaxies is automat- 
ically accounted for as the angular size of the clusters 
changes with redshift. 

4. IMPROVING A 

We now explore whether we can further improve upon 
the results of Paper I in a variety of ways. We define our 
metric to assess improvement in a richness estimator in 
Section l4~Tl Section l4~2l tests the impact of modifying the 
color filter to account for the blue galaxy population of 
cluster galaxies. In Section [4731 we take into account the 
small but non-zero tilt in the ridgeline. Section 14.41 ex- 
plores the effect of the luminosity cut on the Lx-richncss 
relation, and Section ^. 5l focuses on the importance of the 
choice of the radial filter function. Finally, in Section B~6l 
we investigate whether red sequence luminosity is a bet- 
ter Lx tracer than A. Throughout this discussion, we 
have adopted a fixed metric aperture R c = 0.9 /i" 1 Mpc 
for estimating richness, which we demonstrated in Pa- 
per I was near optimal for the cluster sample under con- 
sideration. In section 14.71 we relax this assumption and 
optimize our choice of aperture. 

17 We note that Paper I relied on random point sampling to 
estimate S g rather than making use of the full survey area. This 
does not impact our results in any way. 



4.1. Richness Testing Methodology: What Constitutes 
an Improved Richness Estimate? 

The primary goal of this paper is to improve upon the 
Paper I richness estimator. To do that, however, we must 
first define the metric used to gauge improvement. As 
discussed in the Introduction, our chosen figure-of-merit 
is the scatter in Lx at fixed richness. As in Paper I, we 
limit ourselves to the richest 2000 clusters as ranked by 
the richness estimate under consideration so that our re- 
sults are insensitive to the A^oo > 10 cut in the maxBCG 
catalog. For the original maxBCG richness estimator 
A200, this is equivalent to A200 ^ 20, or an equivalen t 
mass of M 200 > 1 x 10 14 /i" 1 Mpc (| Johnston et al.ll2007t) . 
In Paper I we confirmed that our results with the top 
1000 or 3000 clusters are consistent with the top 2000. 

We denote the original matched filter richness estima- 
tor described in Paper I as Ao- Using the top 2000 rich- 
est clusters in the clean sample, and a fixed 0.9 Mpc 
aperture, we find cr lnix | Ao = 0.69 ±0.02. 18 As discussed 
in Section 5.3 of Paper I, comparing two richnesses is 
complicated by the fact that the errors are correlated. In 
all pairwise richness comparisons, we perform bootstrap 
resampling on the clean cluster catalog and calculate the 
scatter in the top 2000 clusters for both Ao and the new 
richness A nGW - For each bootstrap resampling we cal- 
culate r = o'inL x |A ncw / (7 inLY|Ao- ^ the improvement in 
scatter is not significant, we will find that r is consistent 
with unity, whereas an improved (worse) scatter will re- 
sult in an r value that is significantly less than (greater 
than) 1. This is our primary diagnostic for improvement. 

In addition to the scatter, we also monitor the red- 
shift evolution in the I^-richness relation of each of the 
richness estimators we consider to ensure that no strong 
redshift evolution is introduced by our alterations. We 
measure the evolution using a stack i ng an alysis as de- 
scribed in Paper I and iRvkoff et al.1 (|2008l see Section 
5.3). In brief, we measure (Lx\N) where N is the rich- 
ness measure of interest in three different redshift bins 
(0.10 < z < 18; 0.18 < z < 0.26 ; and 0.26 < z < 0.30). 
As shown in IRvkoff et al.l (|2008l ) , the stacking analysis 
allows us to go much further down the richness function 
than for the scatter analysis. While this introduces selec- 
tion function effects near the A200 > 10 threshold of the 
maxBCG cluster catalog, we expect these to be minor 
in the redshift evolution, which, as mentioned earlier, we 
only use as a sanity check. We fit the stacked data with 
a power-law evolution in redshift, 



(L X \N)=A 



1 + z 
1 + z 



(13) 



where z is the median redshift of the cluster sample and 
N is the richness measure of interest. We find that 7 = 
0.7 ± 0.8 for Ao, consistent with no evolution. 

As noted in Paper I, even if the relation between N and 
cluster mass is redshift independent, we may expect to 
observe evolution in the L \ — N relation due to evolution 
in the Lx — M relation. The expectation for self-similar 
evolution in Lx at fixed mass i s Lx oc p c for soft-band 
X-ray luminosities (|Kaiserlll986l) , where p c is the critical 
density of the Universe at redshift z. In a ACDM Uni- 
verse with tt m = 0.25, Lx oc p c is well approximated by 

18 We note that we get c onsi stent values with DR4 and DR7 
photometry; see also Section 15.61 
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L x oc (1 + z) 110 , or 7 = 1.10. Thus, for A , the evo- 
lution is consistent with both the no-evolution and the 
self-similar evolution models. As we do not know what 
the true evolution should be, we do not use the evolu- 
tion parameter 7 as a true comparison between richness 
estimators. That said, modest evolution in the richness- 
mass relation is a desirable property for richness estima- 
tors, so we do check that 7 remains in the range w — 1.5 
as we modify A. 

4.2. Blue Cluster Members 

The red sequence is well-suited to cluster finding due to 
its high contrast with the background, and its strong red- 
shift dependence (e.g . lBower et aLlll992tlGladders fc Yed 
I200d: iKoester et alTl2007a|) . However, not all cluster 
galaxies are red, only the majority, with typical red - 
fractions being of order « 80% (e.g. lHao et alj [2009h . 
We now explore whether accounting for this blue galaxy 
population in our color filter improves our richness esti- 
mator. 

We first empirically construct a new color filter that 
accounts for both red and blue galaxies. As an input, we 
use the 2000 richest clusters as measured by our origi- 
nal matched filter richness estimator, Ao- We then bin 
these clusters in 10 redshift bins of width 0.02, and se- 
lect all galaxies within 0.9 ft. -1 Mpc of the BCG brighter 
than 0.4L*. To estimate galaxy luminosity, we assume 
all galaxies are at the cluster redshift. The empirical 
color distribution of these galaxies is then background 
subtracted and fit as a sum of two Gaussians using the 
Error Corrected Gaussian Mixture Model (ECGMM) of 
lHao et all (|2009t ). which allows us to properly take into 
account photometric errors. The location, width, and 
relative amplitude of these two Gaussians define the 
appropriate color filter for the color distribution of all 
(red+blue) cluster galaxies within our chosen aperture. 
Additional tests have shown that the color distribution 
does not depend significantly on cluster richness Ao. 

Figure Q] shows our best fit model (solid green line) for 
the lowest redshift bin (0.1 < z < 0.12) and the highest 
redshift bin (0.28 < z < 0.30). Note our best fit model 
is narrower than the binned histogram since the latter 
is broadened by photometric errors, particularly in the 
case of our high redshift bin. 

Our empirically determined model filter takes the form 



G(c\z) = 



"red 



2lT(J r 



— exp 

id 

«bluc 



(c - ^red) 5 



2a 



■ exp 



(c - A^bluo) 5 



(14) 

where a ro( j and «biuo are the relative weights of the red 
and blue Gaussians; fi red and /ibiue are the mean color 
of the red and blue galaxies at redshift z; and a T ed,mt 
and <7biue,mt are the intrinsic scatter for the red and blue 
galaxies. Each of these parameters is fit as a function of 
redshift using a simple linear relation, with the exception 
of the Gaussian widths, which we find is consistent with 
no evolution. Our final model parameters are 

Mrcd =0.629 + 3. 016z 

/ibiuc = 0.590 + 2.115z 

o-red = 0.786 - 0.303z 

"blue = 1-0 — a Icd 



(15) 
(16) 
(17) 
(18) 




Fig. 1. — Color histograms of maxBCG clusters in the low- 
est redshift bin (top panel) and the highest redshift bin (bottom 
panel). The black histogram comprises the total of cluster mem- 
bers brighter than 0.4L* within 0.9 h~ 1 Mpc, after subtracting out 
the predicted number of background galaxies. The green curve is 
our best fit double-Gaussian model, taking into account the "obser- 
vational broadening" brought about by photometric errors. This 
explains why our best fit model is narrower than histogram, par- 
ticularly for our high redshift bin. 



Trcd.int = 0.05 (19) 

Oblue.int = 0.25. (20) 

We can now replace the Gaussian color filter in Eqn. [T^] 
with this new filter, and combine it with the original 
luminosity and radial filters to measure a new rich- 
ness, Ared+biuc- The scatter in Lx at fixed richness is 
o"inL x |A rod+b i uc = 0-72 ± 0.02, and the corresponding red- 
shift evolution parameter is 7 = 0.1 ±0.6. We note 
that the width of the color filter for the blue cluster 
members derived here is slightly wider than t hat derived 
from spectroscopically confirmed subsamples (jHao et al.1 
l2009f ). implying that there is some contamination from 
background galaxies. However, further tests have con- 
firmed that our results are insensitive to the precise width 
of this component (see also Section 15.31) . 

To measure the significance of the change, we use the 
bootstrap resampling described in Section 14.11 and find 
r = 1.04 ± 0.02. That is, including the blue galaxies in 
the filter increases the scatter at a significance of 2a. 
This may be due to the fact that measuring blue cluster 
members is inherently noisier due to the smaller contrast 
with the background, or it may reflect that blue galax- 
ies in clusters tend to have fallen in more recently (e.g. 
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lAbraham et al.l Il996f ). thereby adding stochasticity to 
the richness measure. Adequately addressing this issue 
would require us to repeat this analysis using full spec- 
troscopic membership information, which we do not cur- 
rently have available for this sample. We also note the 
possible correlation of the radial and luminosity filters 
with the color filter, as blue cluster galaxies are gener- 
ally faint er and further from the core than the red galax- 
ies (e.g., lHansen et al.ll2009l ). However, as we show in 
Paper I, the color filter is dominant so we do not expect 
this to be a significant issue. We conclude that attempt- 
ing to include blue galaxies in richness estimates when 
only photometric data is available will increase the scat- 
ter of the Lx^richness relation. 

4.3. Red Sequence Tilt 

Thus far, we have treated the color of the red sequence 
cluster members as a function of rcdshift only. How- 
ever, in addition to a zero-point (mean color) and scat- 
ter, the ridgeline has a slope in color-magnitude space: 
fainter (less massive) galaxies have bluer colors, pos- 
sibly re flecting trends from the mass-metallicity rela- 
tion Ce.g.lKodama fc Arimotolll997HBernardi et al.ll2005l : 
iDe Lucia et al.ll2007l ). In this section, we investigate the 
effect of red sequence tilt and its redshift evolution on 
t he matched filter richness estimation. 

lHao et ail (120091 ) used the maxBCG cluster catalog to 
measure the slope and intercept of the g — r vs. i color- 
magnitude relation for red-sequence galaxies. They find 
that the ridgeline slope is nearly independent of cluster 
richness, consistent with findings that the s lope is inde- 
pendent of environment (jHogg et al.ll2004D . The color 
filter that incorporates this tilt is described as follows: 



G(c, m\z) 



2na 



exp 



d{c, m\z) 



where 



d(c, i\z) = c — (mn(i — 17.0) + 617). 



(21) 



(22) 



The slope and zero-point at i 
lHao et al.1 ((20091) : 



17 are taken from 



rnir = -0.0701z- 0.008969 (23) 
6i7 = 3.2982z + 0.58907. (24) 

As before, c = g — r is the relevant color and m denotes 
the i band magnitude. The net dispersion a is taken as 
the sum in quadrature of the intrinsic color dispersion 
Pint = 0.05 and the estimated color error <r c . 19 

After replacing the Gaussian color filter in Eqn. [12] 
with the tilt-based filter in Eqn. 1211 we measure the new 
richness Atiit- The scatter in Lx for the new richness is 
o"i n £ x |A tut = 0.69 ± 0.02, and the corresponding redshift 
evolution parameter 7 = 1.3 ±0.5. The change in scatter 
is insignificant, with r = 0.996 ± 0.01. Therefore, in- 
corporating the tilt of the red sequence does not make a 
significant difference in the scatter or redshift evolution. 

This is not particularly surprising: not only is the tilt of 
the red sequence small compared to the intrinsic scatter, 

19 Note that in principle one could measure distances to the 
perpendicularly to the ridgeline, as opposed to along the color axis 
as we have done. Fortunately, the fact that the tilt of the red- 
sequence is small implies that we do not expect such differences to 
be significant. 



in Paper I we demonstrated that the adopted Gaussian 
color filter was very robust relative to small systematic 
offsets between the center of the filter and the true mean 
color of cluster galaxies. That said, the tilt of the red se- 
quence evolves with redshift, becoming increasingly im- 
portant at higher rcdshifts. Thus, it is possible — even 
likely — that our above conclusion will not hold at high 
redshifts, or when extending A to fainter luminosities, 
thereby increasing the lever-arm over which the tilt of 
the red sequence can act. Therefore, despite the fact 
that we do not observe a significant improvement, we 
have opted to incorporate the tilt-based color filter from 
Eqn.[l2]into our standard definition of A for further tests. 

4.4. Going Deeper 

When estimating richness, one must adopt a luminos- 
ity or magnitude cut. In Paper I, we adopted a luminos- 
ity cut L cut = 0.4L», which was chosen to allow consis- 
tent richness estimates across the entire survey volume 
(0.1 < z < 0.3) while maintaining high precision photom- 
etry for the faintest galaxies considered. However, this 
cut is not uniquely specified by these conditions. The full 
DR7 input catalog is complete to i rj 21.3, which corre- 
sponds to a luminosity of 0.1L* at a redshift of z = 0.3, so 
we can easily go deeper. We now explore whether doing 
so can reduce the scatter in the richness-mass relation. 

We have calculated the matched filter richness A for a 
set of luminosity cuts: L cut /L* = {0.10, 0.15, 0.20, 0.25, 
0.30, 0.35, 0.40} for every maxBCG cluster. 20 For this 
test, we use the color filter including the red-sequence 
tilt, as described in the previous section. 

Figure [2] shows the comparison of the scatter in Lx 
at fixed richness for different luminosity cuts for the 
top 2000 clusters (black diamonds). We find that there 
does in fact appear to be an optimal luminosity cut 
L C ut = 0.2L* (?7i„ + 1.75 mag) below which A fails to im- 
prove any further. We compare the richness for the differ- 
ent luminosity cuts relative to this value using the boot- 
strap resampling method, and find that the proposed cut 
of 0.2L, is significantly (5cr) better than the original lu- 
minosity cutoff of 0.4L*. The resulting scatter value is 
cinLxlA = 0.63 ±0.02, with a corresponding redshift evo- 
lution parameter of 7 = 0.6 ± 0.5, consistent with both 
no evolution and self-similar evolution. 

One question that may arise from looking at Figure [2] 
is whether the flattening in the scatter as a function of 
luminosity cut is driven by the fact that by L cut = 0.1L* 
one begins to approach the limiting magnitude of the 
SDSS galaxy catalog for galaxies at z = 0.3. To test this 
hypothesis, we have run the same scatter analysis on the 
lower redshift half of our cluster sample, i.e., clusters 
with redshift below the median redshift of the sample, 
Zmcd < 0.23. We maintain the same equivalent space- 
density cutoff as the full run (black diamonds) by lim- 
iting the scatter measurement to the top 1000 clusters. 
The resulting points are shown in Figure[2]as red squares, 
and clearly display the same behavior as the full cluster 
catalog. At z = 0.23, a luminosity of 0.2L* (0.1L*) corre- 
sponds to i = 19.9 (20.7), significantly brighter than the 
limiting magnitude of the catalog. Thus, we can only 
conclude that the flattening of the scatter below 0.2L, is 
real, and not due to photometric errors of faint galaxies. 

20 See Section l3.2l for details on the calculation of L„. 
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Fig. 2. — Comparison of the scatter in Lx at fixed richness for 
a set of luminosity cuts. To assess the significance of the improve- 
ment, each richness scatter, cri n l x \\ 1s compared via the bootstrap 
resampling method to the scatter measured with a luminosity cut- 
off of 0.2L,, c r \ n L x \\(0.2)- The black diamonds show the results for 
the full cluster sample, and the red squares for the lower redshift 
clusters with z < 0.23. In both cases the scatter decreases until 
a luminosity cutoff of 0.2L*, and does not improve with deeper 
observations. Note that the comparison of A at 0.2L, to itself is 
identically 1. 



For the time being, we simply adopt this new optimal 
luminosity cut, postponing the discussion of the origin 
of this cut to Section [51 For reference, decreasing our 
luminosity cut from 0.4L* to 0.2L„. increases the cluster 
richness by an average of ss 65%. 

4.5. Radial Filter 

The NFW profile was originally introduced as a good 
fit for t he dark matter dist ribution in N-body simu- 
lations (jNavarro et al.1 119951 ). Although galaxies will 
not necessarily follow the same distribution as that of 
dark matter, studies have shown that the number den- 
sity of cluster galaxies can be d e scribed by an NFW 
function (e.g . iLin fe Mohrl 12004 iPopesso et all 120071 : 
lHansen et atll2005D . Nevertheless, a filter based on an 
NFW profile might not necessarily be ideal. In this sec- 
tion we investigate the effect of changing the radial filtcr 
function. 

A projected NFW profile extends to infinity, so we are 
required to normalize the filter taking into account the 
cutoff radius. An alte rnative radial profile suggested by 
iPostman et al.l (|1996| ) instead goes to zero at the cutoff 
radius: 

(25) 

The equation assumes R < R Cl where R c is the clus- 
ter radius as defined in Eqn. [H outs i de of this radius, 
P = 0. We follow iPostmanet ail (|1996f ) in setting 
-Rcorc = 100/i _1 kpc, and we normalize the filter as in 
Eqn. [5] This profile gives more weight to the central 
galaxies and less weight to the peripheral galaxies than 
the NFW filter. We denote the resulting richness, A pos t- 



Finally, we also investigate a third possibility, that of 
replacing the radial filter 2ttRYj(R) with a flat top-hat 
function, i.e. oc l/R or isothermal, which defines 

Aflat. This gives equal weight to cluster galaxies at the 
center and those at the periphery 

We perform a pairwise comparison of the scatter in 
Lx at fixed richness among our three richness estima- 
tors generated with three radial functions, Anfw, A pos t, 
and Aflat- For these tests, we use the best luminos- 
ity cutoff (0.2L*), the color filter including tilt, and 
fix the cutoff radius at 0.9/i _1 Mpc. In the compar- 
ison of the flat profile to the NFW profile, we find 
r = ^ixlAfutM-xlANPw = 1.03 ± 0.015. Thus, using the 
NFW profile is better than using a flat radial profile at 
the 2cr level. Comparing the Postman profile to the NFW 
profile, we find r = <rL x \\ pms JvL x \\ n w = 1-016 ±0.016. 

The NFW profile gives slightly more weight to periph- 
eral cluster galaxies than the Postman filter, and less 
weight to the peripheral galaxies than the flat filter. The 
NFW filter, somewhere in the middle, narrowly outper- 
forms the other two, which are closer to the extremes. 
That said, it is worth noting that up to sa 30% number 
of BC Gs in the maxBCG cat alog may not be at the halo 
center \ Johnston et al.ll2007D . Therefore, we cannot rule 
out the possibility that our results are driven at least 
in part by miscentering inherent to the maxBCG cata- 
log. Nevertheless, our tests in this section show that the 
shape of the radial profile has a relatively weak effect on 
the fidelity of the richness estimator A. Thus, we do not 
feel it is necessary to expend further energy in exploring 
a broad variety of possible radial filter functions. 

4.6. Luminosity Weighting 

Total optical luminosity of a cluster has been sug- 
gested as a superior mass tra cer to simply galaxy count- 
ing (e.g IPopesso et all I2005L and others), though any 
such improvement is likely to be small as the optical lu- 
minosity and t otal number of cluster galaxies are highly 
correla ted (e.g. IPopesso et al.l [20051 l2007t iKoester et al.l 
l2007a|) . In this section we explore this possibility by us- 
ing the total optical luminosity of red sequence galaxies 
in clusters as an X-ray luminosity tracer. 

We have already seen that our A formalism naturally 
produces a red-sequence based cluster membership prob- 
ability. Consequently, we can readily estimate the total 
red-sequence cluster luminosity L\ via 

i 

where L\ is the luminosity weighted lambda, pj is the 
membership probability of galaxy j, and Lj is the lu- 
minosity of galaxy j. The luminosity is defined as the 
i-band luminosity of a galaxy at z = 0.25, and all galax- 
ies are /c-corrected assuming the galaxies are red galaxies 
at the redshift of the cluster. We calculated L\ for our 
clusters using the the best luminosity cutoff (0.2L*), the 
color filter including tilt, and with a fixed 0.9/i _1 Mpc 
aperture. We find a scatter of <7i n l x \l x = 0.68 ± 0.02, 
and a redshift evolution parameter 7 = — 1.0±,0.6. The 
bootstrap comparison to the corresponding A estimate 
results in r = <7i n L x \L x /°~inL x \\ = 1.09 ± 0.02. Thus, 
calculating the red-sequence luminosity of the clusters is 
a worse tracer of Lx than pure red-sequence counts. 
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One worry when interpreting this results is that the 
result may be systematics driven. Specifically, in esti- 
mating the cluster luminosity we assume all galaxies are 
at the cluster redshift, which can dramatically boost the 
luminosity assigned to any foreground galaxies. Even 
with low membership probabilities, such boosts might 
significantly bias the estimated cluster luminosity. To 
test this hypothesis, we have repeated our experiment 
while restricting the sum in Eqn. [25] to galaxies fainter 
than the BCG. We find that our results are robust to 
this change, which suggests foreground interlopers are 
not the culprit. There is the additional possibility that 
the more numerous faint galaxies, with larger photomet- 
ric errors, are increasing the noise. However, the same 
trend — that simple number counts are superior to lumi- 
nosity weights — appears with richnesses using brighter 
luminosity cuts. Thus, we conclude that the total red- 
sequence cluster luminosity is a worse Lx tracer than 
red-sequence galaxy counts. 

Another type of luminosity weighting that has been 
suggested in t he past is weighti ng by the luminosity of 
the BCG (e.g. iReves et al.ll2008D. In Paper I, we showed 
that the richness estimate of lReye7 et all (|2008| ). while 
superior to -/V200, has a significantly larger scatter than 
Ao- Similarly, in the preset work we did not observe any 
significant difference when combining A with the BCG 
luminosity This may be simply a fact that our tests 
are primarily focused on the high richness end. It is 
clear that at sufficiently low richness — e.g., A = 1, or 
a single red galaxy — the luminosity of the BCG must 
contain additional information about the mass of the host 
halo. We emphasize that t hese results do not contradict 
those of lReves et all (|200l . The reason is that although 
significant improvements can be made relative to N200, 
these are not as easily achieved relative to our optimized 
A richness estimator. 

4.7. Aperture Optimization 

Having explored ways in which A could be improved 
while relying on a fixed metric aperture, we now turn to- 
wards optimizing cluster radii. Throughout this section, 
we use our final set of filters, given by 

u(c, to, r\z, A) = [2TrKE(R)}<l>(m)G(c, to), (27) 

where is given by Eqn. [51 4>{m) is given by Eqn. 1101 
and G(c, to) is given by Eqn. 1211 In addition, our lumi- 
nosity filter now extends to a luminosity cutoff L cut = 
0.2L*. For the rest of this paper, we denote the matched 
filter richness estimate from this filter simply as A. We 
proceed now to optimize the radial scaling parameters 
Ro and /3 from Eqn. 01 

We optimize the radial aperture using the procedure 
laid out in detail in Paper I. Briefly, we begin by defin- 
ing a coarse grid in R and (3, and estimate the scatter 
o~\ n L x \\ along this grid. Once we have a rough idea of 
where the minimum lies in parameter space, we repeat 
this search using a finer grid centered on the expected 
minimum. We then use bootstrap resampling to esti- 
mate the lcr and 2er confidence contours of the location 
of the scatter minimum in the Rq — ft plane. For a more 
detailed description of this algorithm, we refer the reader 
to Section 4 of Paper I. 

Figure [3] shows our final set of contours. The mini- 
mum on the fine grid is consistent with the coarse grid, 
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Fig. 3. — Contours of the probability density of the location 
of the point in the Ro — /3 plane that minimizes the scatter in 
Lx at fixed richness. The lines show the lcr and 2a contours. 
In the interest of simplicity, we choose as our final fiducial pa- 
rameters Ro = 1.0 /i -1 Mpc and /3 = 0.2 in the scaling relation 
Rc = iJo(-VlOO)' 3 , which are consistent with the minimum scatter. 
We note that these parameters yield R c ~ 0.9 h Mpc for a clus- 
ter with A = 45, the median richness of the richest 2000 clusters, 
and thus the reference aperture of 0.9 h^ 1 Mpc is consistent with 
the scaled aperture. 



with radial scaling parameters of Rq — 1.0 ft. -1 Mpc and 
(3 = 0.2 for use in the scaling relation R c = i? (A/100) /3 . 
These values are in good agreement with the trends seen 
in Paper I. Although the lcr contour is closed, there is 
a broad degeneracy region that extends down to a fixed 
metric aperture with Rq = 0.9 hr l Mpc and (3 = 0.0. In 
particular, for a cluster with A = 45, the median rich- 
ness of the richest 2000 clusters, the scaled aperture is 
~ 0.9 hr 1 Mpc. Therefore the reference fixed aperture of 
0.9 ft. -1 Mpc is consistent with the scaled aperture, and 
the richness comparisons in the previous sections are in- 
deed valid for our final richness estimator. However, as 
discussed in Paper I, the existence of the degeneracy line 
shown in Figure [3] is largely driven by the limited rich- 
ness range that we can probe using X-ray luminosity. We 
fully expect the variable aperture approach is superior to 
a fixed metric aperture, particularly when probing lower 
richness systems, although those systems are out of reach 
for our present analysis. 

For the radial scaling parameters of Rq = 1.0 h^ 1 Mpc 
and /3 = 0.2 used in the final version of A, the scatter 
in L \ for the top 2000 clusters is er lnix | A = 0.63 ± 0.02, 
and the evolution parameter is 7 = 0.5 ± 0.5. These 
values are consistent with those estimated for the fixed 
0.9 ft. -1 Mpc aperture, and the relative improvement is 
not significant, with r = 0.99 ± 0.02. 

5. SYSTEMATICS 

In this section, we discuss several sources of systematic 
error that may have an effect on the calculation of the A 
richness estimator. Our goal is to show that our method 
is particularly robust to common perturbations, and fur- 
thermore will produce consistent results even when we 
are not using SDSS data. In particular, in Section 15.11 
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we explore how A changes if alternative filters arc used 
to isolate the red sequence. In Section l5~2l we look at the 
effect of the normalization of the global background. In 
Section [5~3l we look at variations in the width of the red 
sequence in the model, and in Section l5~4l we look at the 
effect of uncertainty in the photometric zero-point. In 
Section F5T51 we investigate the effects on the calculation 
of A if we do not know the color-redshift relation. Fi- 
nally, in Section [5751 we determine the effect of "catalog 
noise," false galaxies that add noise to the input galaxy 
catalog. 

5.1. The Robustness of X to the Choice of Filters 

Up until now, we have focused our analysis on the 
color- magnitude relation as described by g — r v. i. As 
discussed above, the g — r color is well suited for clus- 
ter photometric redshift and richness estimation in the 
0.1 < z < 0.3 range covered by the maxBCG catalog. 
However, this filter combination is not uniquely able to 
isolate the cluster red sequence and evolve smoothly with 
redshift. In order for the richness estimator A to be gen- 
erally useful, and to be extended to higher redshifts, it 
must be robust to changes in filters. In this section we 
use additional SDSS data to investigate how A changes 
with alternative filter choices. 

We have chosen to focus our analysis on three alterna- 
tive color combinations: g — i, r — i, and u — r. Three 
bands — g, r, and i — have high quantum efficiency in 
SDSS, and thus we do not need to worry about significant 
problems with photometry near the limiting magnitude. 
When using the u band data we limit the redshift range 
to z < 0.25 (see below). The colors g — i and u — r have 
the advantage of containing the g band which changes 
strongly as the 4000 A break moves with redshift. Al- 
though the r — i color evolves with redshift and allows 
us to pick out the red sequence, this evolution is not as 
strong as when combined with the g color, and thus we 
expect that A may not perform as well. 

To create the re d sequence mode l of the filter, we fol- 
low the method of lHao et~al"1 (|2009l see Section 5.1). For 
each cluster with Ao > 10 we take all galaxies within 
0.9 Mpc and brighter than 0.4L*. We use the error- 
corrected Gaussian mixture model to decompose the red 
sequence and blue cloud/background for the color of in- 
terest. We then fit a linear model to all galaxies with 
±2<T of the red sequence, yielding a slope and intercept 
at i = 17, mi7 an d &17 (as in Section |4.3|) . Next we bin 
the clusters in redshift bins of width 0.02 and calculate 
the mean (77117) and (&17). Finally, we fit a linear model 
as a function of redshift to obtain a simple functional 
form of (mn\z) and (bn\z) akin to Eqn. [23l This model 
is used to calculate the given richness within a fixed met- 
ric aperture of 0.9 ft. -1 Mpc. 

Figure [4] shows the comparison of A calculated with 
various color filters to the basic X g - r . In the top panel we 
compare X g -i, in the middle panel we compare A r _i, and 
in the bottom panel we show X u - r . The top two panels 
show all maxBCG clusters, and the bottom panel only 
those with z < 0.25 due to the smaller sensitivity in the 
u band. The inset histograms illustrate the distribution 
of differences between the given richness and A g _ r . 

Table [1] shows the results of comparing the richness es- 
timates to the baseline A 9 _ r . First, we have estimated 
the width of the distribution, AA rms , shown in in the in- 
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Fig. 4. — Comparison of A calculated with different filter combi- 
nations, and compared to the basic A 3 _ r . In the top panel is A g _i, 
in the middle panel is A r _i, and in the bottom panel is A u _ r . The 
inset histograms illustrate the distribution of <5 = A a i ter natc — -^9— r- 
Changing from the g — r color to the g — i color does not have a 
significant effect on the richness estimation. 



Table 1, Comparison of richnesses to benchmark A 9 _ r 



Richness 


AX 


r 


Xg-t 


-0.9 ±2.3 


1.01 ± 0.01 




-1.5 ±5.2 


1.11 ±0.03 


Xu — r 


-0.5 ±5.1 


1.12 ±0.03 



Note. — All richnesses are measured in a fixed metric aperture 
of 0.9 h^ 1 Mpc. AA shows the change relative to the benchmark 
A 9 _ r . 



set plots of Figure SJ We report A A rather than A A/ A 
because it is the former that is roughly constant with 
richness. Second, we have compared the scatter in o~l x \x 
using the bootstrap resampling ratio r described in Sec- 
tion 14.11 For the first two alternative versions of A we 
use the top 2000 clusters in the full redshift range; for 
A u - r we use the top 1000 clusters for the lower redshift 
range 0.1 < z < 0.25. 

It is readily apparent that the change from X g - r to 
Xg-i is nearly insignificant. The median richness of the 
top 2000 clusters is ~ 45, so the observed scatter is < 
4%. For reference, Poisson scatter at this richness would 
correspond to w 15% scatter, while iRozo et al.l (|2010af ) 
estimate the scatter in A200 at fixed mass to be ps 35%. 
In short, g — i is effectively as efficient as g — r for the 
purposes of selecting red-sequence galaxies, reflecting the 
fact that the 4000 A break falls within the g filter. 

Focusing now on the r — i and u — r filter combinations, 
we see these choices exhibit a significantly larger scatter 
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in AA, of order 10 %, still less than P oisson scatter, and 
much less than the lRozo et al.l (|2010af ) estimate for iNfaoo- 
This increased scatter relative to g — r is also reflected as 
increased scatter in Lx at fixed richness. In the case of 
r — i, this can be understood by the fact that our filter 
combination does not straddle the 4000 A break, and 
therefore the red sequence is not as prominent against 
the background. The u — r color does straddle the 4000 A 
break, but suffers from the much lower SDSS sensitivity 
in the u band. 

In summary, we see that an effective use of the A rich- 
ness does not require the specific g — r filter combination 
used in this work. Indeed, use of any filter combination 
that can effectively isolate the red-sequence will result in 
nearly identical and unbiased values for X, although the 
scatter is reduced when using high sensitivity filters that 
straddle the 4000 A break. Even in the case of the less 
optimal r — i and u — r combinations we get a reason- 
able, although not ideal, richness estimator. Due to the 
stability of A when using different filter combinations, we 
expect these methods to be easily generalizable for other 
telescopes and higher redshifts. 

5.2. Background Normalization 

With the entire SDSS DR7 at our disposal, covering 
~ 7500 square degrees, we can make a very accurate 
estimation of the global background as a function of color 
and magnitude. In other instances, however, there may 
be greater uncertainties in the true mean background for 
a given filter combination. In this section we investigate 
the effect of varying the background level on A. 

In general, uncertainties in the background will be 
a function of color and magnitude. For simplicity, we 
model uncertainty in the background as a boosting or 
de-boosting of the background normalization. Our in- 
tention with this approximation is to give a rough esti- 
mate of the systematic uncertainty in richness estimates 
given the uncertainty in the background. We expect that 
a boost of the background normalization 6(x) will cause 
the richness A to decrease by a fixed amount (indepen- 
dent of A) and a decrease of the background normaliza- 
tion will cause A to increase by a similar amount. 

In Figure [5] we show the effect of changing the back- 
ground normalization on the calculation of A in a fixed 
0.9 h^ 1 Mpc aperture. When the background normaliza- 
tion is deboosted by a factor of 0.5, then A changes by 
AA = 6.6±2.9, and when it is boosted by a factor of 1.5, 
then AA = — 4.2±1.7. For a moderate richness cluster of 
A ~ 30, incorrectly estimating the background by ~ 50% 
can bias the richness by ~ 20%, appropximately equal 
to the Poisson scatter. On the other hand, a ~ 10% 
error in the background normalization has a negligible 
effect on the richness estimation of ~ 3%. We emphasize 
these values are only meant to help in estimating how 
precisely the background must be modeled in order to 
ensure a negligible impact on richness estimates. 

5.3. Red Sequence Width 

In our adapted color filter from Section l4~3l and in all 
our tests, we fix the intrinsic scatter in the red sequence 
at dint = 0.05. We now investigate the effect on A and 
o~l x \\ on changes in the intrinsic scatter, similar to the 
tests of Section 4.3 in Paper II. For this test, we fixed 
the radial size at 0.9 h~ Y Mpc, and set the red sequence 
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Background Adjustment Factor 

Fig. 5. — Change in A (AA) as a function of background adjust- 
ment factor. When the background normalization is deboosted by 
a factor of 0.5, the richness estimate changes by AA = 6.6 ± 2.9, 
and when it is boosted by a factor of 1.5, then AA = —4.2 ±1.7. If 
one knows the background normalization to ~ 10% then the bias 
in the richness estimation is less than the statistical error. 



width to 0.03 mag and 0.07 mag, representing a narrow 
and wide extreme. 

In each case, we find that the effect on both the scatter 
and the richness estimate is negligible. For a mt = 0.05, 
then AA = -1.5 ± 1.0 and for cr int = 0.07, then AA = 
1.2 ± 0.9. Unsurprisingly, if we take a narrower (wider) 
red sequence then the richness estimate decreases (in- 
crease) as we lose (gain) galaxies at the margins. How- 
ever, the total probability of these additional galaxies is 
quite small, and the overall bias in the richness estimate 
is negligible. Note that one reason that A is not very 
sensitive to the red sequence width is because the color 
filter in Eqn.[l2]uses the sum in quadrature of the photo- 
metric error and intrinsic color dispersion. We conclude 
that our richness estimator is robust to changes in the 
red sequence width, and thus is suitable for other filter 
combinations that span the 4000A break that may have 
slightly different ridgelinc widths. 

5.4. Zero-Point Uncertainty 

The calculation of A requires measuring all the galaxies 
above a given luminosity threshold that evolves with red- 
shift. In practice, uncertainty in the luminosity threshold 
is equivalent to uncertainty in the photometric zero-point 
as well as systematic offsets in differing methods of calcu- 
lating galaxy magnitudes. In this section we explore the 
effect of an offset in the photometric zero-point, which is 
equivalent to each of these effects. 

As before, we run with a fixed 0.9 h~ x Mpc aperture 
with a range of systematic offsets in the magnitude limit 
for the luminosity function filter (Eqn. [T0|) . We find 
that for small zero-point shifts - up to ±0.05 mag - the 
effect on A is negligible (6 A < 1) for the vast majority of 
clusters, consistent with the observation in Paper II that 
such errors do not significantly impact the scatter of the 
richness-mass relation. This is because only rarely is a 
red sequence galaxy added or removed from considera- 
tion when making such a small shift. Of course, large 
photometric shifts in rich clusters can be significant. For 
instance, a ±0.1 mag shift in a rich cluster can change A 
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by ps 10%. Note, however, that such photometric errors 
are much larger than expected for upcoming photometric 
surveys. 

5.5. Uncertainty in the Color- Redshift Relation 

In this section we investigate the effect of uncertainty 
in the color-redshift relation for the Gaussian color fil- 
ter. If we were to follow the strategy of the preceding 
sections, we would approach this issue by systematically 
shifting the slope and intercept of the color filter. How- 
ever, this treatment would unrealistically ignore the data 
at hand. When observing an actual galaxy cluster, we 
can always measure the red sequence for each individual 
cluster. Therefore, we explore the effect on A if we mea- 
sure the red sequence intercept and slope for each cluster 
individually. 

For each maxBCG cluster, we first take all the galaxies 
within 0.9 h^ 1 Mpc and brighter than 0.2i». To measure 
the intercept and slope of the red sequence, we follow the 
method of Sections 14.21 and 14.31 The color distribution 
of galaxies is decomposed into two Gaussians (the red 
sequence and the blue/background galaxies) using the 
Error Corrected Gaussian Mixture Model of iHao et all 
(|2009f) . To limit the de grees of freedom to allow for accu- 
rate fitting of relatively poor clusters (A < 30), we fix the 
width of the red sequence component at crj nt = 0.05 mag. 
One could in principle also attempt to fit this from the 
data, but we have already shown that the recovered rich- 
ness is largely insensitive to this parameter, and the typ- 
ical ridgeline width is ps 0.05. If the mixture model is 
unable to identify two distinct components then the clus- 
ter is flagged and A is not measured. We then take all 
galaxies within 2<7i nt and fit the red sequence slope and 
intercept. These values are then substituted for Eqn[23l 
to calculate the richness A. 

Figure |6] shows the comparison of Agt j where we fit for 
the red sequence on a cluster-by-cluster basis, to A in 
a fixed 0.9 hr 1 Mpc aperture. Results are nearly identi- 
cal with a variable aperture with Rq = 1.0 hr 1 Mpc and 
/3 = 0.2. The inset plot shows that for individual clusters 
A shifts by a negligible 0.1 ± 0.6. Overall, the richness 
measurement is extremely robust to perturbations in the 
red sequence location. This is due to a combination of 
the contrast of the red sequence with the background 
galaxies, and the fact that the smooth Gaussian color 
filter is more tolerant of color offsets than a top-hat filter 
(as explored in Section 6 of Paper I) . It should be noted 
that our automated algorithm for red-sequence fitting 
succeeded in measuring the red-sequence for all clusters 
of richness A > 50. At A < 40, the failure rate was 
ps 10%. We are confident that this failure rate could be 
decreased if one incorporated even mild priors on the red- 
sequence parameters, for instance from population syn- 
thesis models. The main point here is not the automated 
red-sequence fitting, but rather the fact that so long as 
the red-sequence can be properly fit from the data, one 
does not need an a-priori model for the red-sequence in 
order to be able to compute A. Of course, having such a 
model will help in the limit of low signal-to-noise, and is 
less computationally intensive. 

5.6. Catalog Noise 

As discussed in Section 12.31 a clean input catalog is 
required for accurate cluster finding and richness estima- 
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Fig. 6. — Plot of Afiti using the red sequence fit for each individual 
cluster, against A, our optimized richness, with a fixed 0.9 h~ 1 Mpc 
aperture. The inset shows the histogram of AA, which has a neg- 
ligible offset and scatter of 0.1 ± 0.6. However, at A < 40, about 
10% of the clusters are not measured as we were unable to prop- 
erly decompose the red sequence component from the background 
using the ECGMM method. 



tion. The old mantra "garbage in, garbage out" is espe- 
cially apt. We refer to the inclusion of any object that 
is not a correctly measured galaxy as "catalog noise." 
This includes stars, asteroids, image artifacts, and im- 
properly measured photometric errors. In general, it also 
includes pipeline-to-pipeline variations in the reduction, 
detection, and photometry for the same object. In this 
section, we obtain a first-order estimate of the magnitude 
of the effect of catalog noise on our richness estimation. 

To test the effect of catalog noise, we compare the scat- 
ter in Lx at fixed richness for two versions of the input 
catalog. The first is the "clean catalog" that has been 
filtered as described in Section 12.31 The second is the 
raw catalog, using all the objects marked as galaxies in 
DR7 without any additional filtering. Figure [7] shows 
the histogram of the number of objects as a function of 
i-band magnitude for stripe 10 in SDSS. There are sig- 
nificantly more galaxies at the faint end in the uncleaned 
catalog (red dashed histogram), many of which arc false 
detections. 

For each input galaxy catalog, we first compute the 
background as described in Section 13.41 We then com- 
pute both Ao, the original matched filter richness, and 
A, the optimized richness, both at the fiducial fixed 
0.9 ft. -1 Mpc scale. We expect that the effect of catalog 
noise will be greater on A, as it uses a deeper luminosity 
cut where the catalog noise is greater. This is indeed 
what we find for the top 2000 clusters, where the scatter 
in the Lx at fixed richness increases from 0.70 ± 0.02 to 
0.72 ± 0.02 for A , and from 0.63 ± 0.02 to 0.66 ± 0.02 for 
A. Using the bootstrap resampling ratio r described in 
Section ^. li we find that using the noisy catalog increases 
the scatter by r = 1.03 ± 0.02 for A and r = 1.04 ± 0.02 
for A. In addition, wc confirm that the scatter measured 
is consistent between the cleaned DR7 catalog and the 
cleaned DR4 catalog used in Paper I. Thus, at least for 
different versions of the SDSS pipeline A is robust to this 
level of pipeline-to-pipeline variations. 

Overall, with our current catalog and our best richness 
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Fig. 7. — Histogram of number of galaxies in SDSS stripe 10 from 
DR7. The solid black histogram shows the galaxies as a function 
of z-ba nd m agnitude for the clean input catalog as described in 
Scction l2,3l The dashed red histogram shows the raw input catalog 
from DR7, without the additional flag cuts. There are significantly 
more galaxies at the faint end, most of which are false detections. 



estimator, we can detect the effect of catalog noise at 
the 2cr level in our chosen figure of merit. Therefore, 
catalog noise is a nuisance, though it is not a critical 
path item. This is partly due to the high quality of the 
raw SDSS DR7 catalog. However, we emphasize that we 
can detect this effect even though we are recomputing 
the background for each input catalog, and these false 
galaxies are presumably not correlated with the cluster 
positions. Thus, increased noise in the background does 
translate to increased scatter in the richness estimator, 
and should be controlled as well as possible. 

6. THE ORIGIN OF THE OPTIMAL RADIAL AND 
LUMINOSITY CUTS 

We have demonstrated the existence of optimal ra- 
dial and luminosity cuts when evaluating the richness 
of maxBCG galaxy clusters. We have not yet, however, 
offered an explanation for the origin of these cuts. In- 
deed, in a naive Poisson scatter model outlined in Paper 
II, one should expect larger apertures and fainter mag- 
nitude cuts to always result in reduced scatter simply 
due to the larger galaxy count. What then changes these 
conclusions? 

We address these questions by relying on the simula- 
tion method from Paper II, which we now briefly sum- 
marize. We model galaxy clusters using an NFW profile, 
with a Schechter luminosity function, and a Gaussian 
color distribution. Each cluster realization is then em- 
bedded in a realization of a uniform density field meant to 
represent the local galaxy background. The background 
density field can be set to the mean galaxy density of 
the universe, which we refer to as uniform background, 
or it can be modeled so as to match both the mean and 
variance of the local density field around SDSS maxBCG 
galaxy clusters. We refer to this latter model as the ran- 
dom background model, since each cluster is embedded 
in a different background. 

There were two key insights from Paper II that are 
relevant for this discussion. The first is that the scatter 
in richness at fixed mass depends sensitively on miscen- 
tering parameters. Indeed, Paper II concludes that the 
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Aperture (h 1 Mpc) 

Fig. 8. — The scatter in richness at fixed mass in our Monte 
Carlo simulations for four different models, as labeled (see text 
for details on the models). We see that miscentering creates a 
floor below which the scatter increases rapidly, while projection 
effects in the realistic random background model (chosen to match 
SDSS data) pushes the optimal aperture inward. Thus, the optimal 
aperture reflects a compromise between these two sources of error. 
Note that even without miscentering we find an optimal aperture 
using the variable background model. The vertical dotted line is 
the optimal aperture from section II. 71 



scatter for maxBCG clusters is dominated by miscenter- 
ing. The second key insight concerns the local galaxy 
background within which clusters are embedded. Specif- 
ically, we find that the majority of clusters are embedded 
in a low density background, with 1% — 5% of the clusters 
embedded in high density backgrounds that results in a 
severe over-estimate of the clusters' richness. These rare 
occurrences were interpreted as the signature of projec- 
tion effects in CDM cosmologies. 

Given these two key insights, we consider whether 
these two effects — miscentering and the stochastic na- 
ture of the background galaxy density — give rise to the 
optimal aperture and luminosity cuts we have uncovered 
empirically. To do so, we perform Monte Carlo realiza- 
tions of galaxy clusters with four different models: 

1. No systematics (no variable background; no mis- 
centering) 

2. Miscentering (no variable background) 

3. Variable background (no miscentering) 

4. Miscentering and variable background 

For each of these four models, we generate between 400 
and 5000 Monte Carlo realizations of galaxy clusters 21 , 
and then measure the richness using a variety of radial 
aperture and luminosity cuts. This data is then used 
to estimate the scatter in richness, which we plot as a 
function of the radial and aperture cuts. 

Figure [5] shows how the scatter in richness at fixed 
mass varies for each of the four models detailed above. 
For models with miscentering, we assumed 80% of the 
clusters are centered correctly, while the remaining 20% 
are radially offset by first picking a random axis, and 

21 See Paper II for details on the construction of the realizations 




Luminosity cut (L.) 

Fig. 9. — The scatter in richness at fixed mass in our Monte 
Carlo simulations for four different models, as labeled (see text 
for details on the models). In miscentcring models, lowering the 
luminosity cut tends to lower the scatter, but only up to a point. 
The vertical dotted line in the figure is the luminosity cut below 
which we did not see any significant improvement in the data. 
Thus, it is likely that the flattening of the luminosity the scatter 
in Figure [2] primarily reflects the miscentcring properties of the 
maxBCG catalog. 



then displacing along this axis by randomly drawing 
from a Gaussian of me an zero, and standar d deviation 
a = 0.4 h~ 1 Mpc (e.g.. Uohnston et alJl2007t ). The ver- 
tical dotted line is the optimal fixed metric aperture of 
0.9 h^ 1 Mpc that we found in the data. Not surprisingly, 
miscentcring requires that the optimal aperture be signif- 
icantly larger than the miscentering kernel, creating an 
aperture floor below which the scatter increases rapidly. 
In the other direction, moving outward is penalized when 
estimating the richness of clusters in the (realistic) ran- 
dom background model. This is easy to understand: if 
the background density is high, the larger the aperture, 
the larger the noise in the richness estimate of those clus- 
ters suffering from projection effects. The optimal aper- 
ture is therefore a compromise between these two sources 
of stochasticity. Note, however, that even without mis- 
centering we find an optimal aperture using the variable 
background model, which simply reflects the fact that 
as the aperture goes to zero, the scatter necessarily in- 
creases since there are fewer cluster galaxies. 

Figure [9] illustrates how the scatter in richness at fixed 
mass varies in our Monte Carlo simulation as a function 
of the luminosity cut employed when estimating cluster 
richness. The vertical dotted line corresponds to the opti- 
mal luminosity cut from section l4~4l Even in the absence 
of miscentering, it can be seen that the gains when using 
a deeper luminosity threshold than 0.2L* are marginal. 
However, miscentering introduces an additional "scatter 
floor," and once this floor is reached, reduction in the 
scatter is no longer possible. We expect that when us- 
ing cluster catalogs with improved centering properties 
further reduction in the scatter should be possible. Ad- 
ditional gains may be made by lowering the luminos- 
ity threshold L cut , although the rate of improvements is 
rather low. Of course, at some point a new scatter floor 
has to arise from other effects (e.g. triaxiality), but these 
effects arc not dominant in the maxBCG catalog if the 
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miscentering model of iJohnston et all ([20071 ) is correct. 

7. SUMMARY AND CONCLUSIONS 

In this paper, we have shown the improvements in the 
matched filter richness estimator presented in Paper I. 
By gauging improvement by the decrease in scatter in 
X-ray luminosity at fixed richness, we can quantitatively 
determine which optical proxies are superior as a tracer 
of halo mass. Our final optimized richness A uses a proba- 
bilistic formalism to estimate the number of red sequence 
galaxies brighter than 0.2L* in the cluster. Wc empha- 
size that our goal is to find a high fidelity mass tracer, 
and we leave an analysis of the complete census of c luster 
galaxies to separate work (e.g.. lHansen et al"1l2009l ). 

Relative to the matched filter richness described in Pa- 
per I wc find: 

1. Lowering the luminosity threshold results in de- 
creased scatter, but only as far as 0.2L*. Using 
Monte Carlo simulations, we show that this limit 
is partially driven by the miscentering properties of 
the maxBCG catalog. Consequently, catalogs with 
improved centering properties may benefit from go- 
ing even deeper. However, even without miscen- 
tcring, the rate at which the scatter is reduced is 
rather modest, so such benefits are likely to be lim- 
ited. 

2. Modifying the color filter to account for the blue 
galaxy population (which makes up > 20% of the 
galaxy population) results in increased scatter. We 
are unable to determine whether the increased scat- 
ter is intrinsic (e.g., the blue galaxies have more 
recently fallen into the cluster), or if it is simply 
caused by the fact that the blue galaxies arc much 
less prominent against the background, yielding a 
noisy measurement. Either way, generalizing color 
filters to include blue galaxies is inadvisable for 
photometric catalogs. 

3. Weighting each galaxy by its luminosity results in 
increased scatter, and weighting each clus ter by the 
BCG luminosity (e.g.. iReves et al.ll2008t) does not 
improve the scatter. However, our tests only probe 
the high richness end, and the possibility of further 
improvements at low richness, where the luminosity 
of the BCG is more dominant, are not ruled out. 

4. Incorporating red sequence tilt does not have a 
measurable impact on the recovered scatter. Nev- 
ertheless, wc have modified our estimator to include 
this tilt, both because it may become relevant for 
fainter luminosity cuts, and because we expect the 
tilt to become more important at higher redshifts 
with different filter combinations. 

Following Paper I, we also optimized the radial aper- 
ture used to estimate cluster richness. Our best fixed 
metric aperture is 0.9 Mpc, though we expect scaled 
apertures should be better due to the standard "bigger 
things arc bigger" maxim. Assuming a power-law rela- 
tion between radial cutoff and cluster richness, we find 
that R c = 1.0(A/100)°- 2 /i _1 Mpe. Although we cannot 
test the scaling relation at low richness, we expect the 
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scaled aperture to be superior to the fixed metric aper- 
ture further down the richness function. Following Pa- 
per II, we used Monte Carlo simulations to demonstrate 
that the optimal aperture we have measured reflects a 
compromise between cluster miscentering and projection 
effects; cluster miscentering creates a hard floor on the 
minimal aperture, while projection effects push towards 
smaller apertures. Even in the absence of miscentering, 
simple counting statistics (smaller apertures find fewer 
galaxies) combined with projection effects would yield a 
similar optimized aperture. We also investigated whether 
the shape of the radial filter can result in improved rich- 
ness estimators, but found that the detailed shape of this 
filter has only a very modest impact on the recovered 
scatter. 

Our work is most comparable to that done using 
the X-ray selected R ASS-SDSS galaxy cluster catalog 
(|Popesso et al.ll2004l ). who first explored the idea of us- 
ing X-ray data to improve and calibrate optica l rich- 
ness/luminosity estimators. iPopesso et al.l (|2004f ) found 
an optimal aperture significantly smaller than the ss 
0.9 h~ x Mpc value advoca ted here. This is not surpris- 
ing. IPopesso et al.l (|2004D did not rely on red-sequence 
galaxy selection, which lowers the density contrast of 
galaxy clusters. Consequently, we expect the optimal 
aperture to move in wards. It is also worth noting that 
IPopesso et al.l (|2004l ) report a scatter in Lx at fixed op- 
tical luminosity L opt of <J\ n Lx\L op t = 0.41, which is much 
smaller than what we have a c hieve d. We caution, how- 
ever, that the IPopesso et al.l (|2004f ) value has not been 
corrected for selection effects. Indeed, they also report 
®~ in L pt | l \ = 0.46, with a scaling of L opt <x L x with 
a = 0.45. For power-law abundance functions, applica- 
tion of Bayes's Theorem relates these two scatters via 

°~lnL opt \L x = aO-\ nLx \ Lopt . (28) 

That this equality does not hold for lPopesso et al.l (|2004f ) 
analysis is a direct consequence of having neglected se- 
lection effects. As a rough estimate, we expect for the 
scatter o~\ n L opt \L x t° more robust to X-ray selection 
— one finds all clusters of a given X-ray flux, but not all 
clusters of a given L opt — so we can use can use equa- 
tion [2H to estimate the corrected scatter in Lx at fixed 
L opt . Wefind a ln£ | Lopt = 0.46/0.45 = 1.02. Given that 
IPopesso et al.l (|2004[) did not use color information when 
estimating optical luminosity, it is not surprising that the 
corrected scatter would be this large. 

The scatter in X-ray luminosity at fixed richness for 
our final richness estimator is <7i n L x |A = 0-63 ± 0.02. As 
was argued in Paper II, this scatter is likely dominated by 
the miscentering properties of the maxBCG cluster cata- 
log, rather than by intrinsic scatter in the richness-mass 
relation. Consequently, we expect that making improve- 
ments to the centering algorithm used in cluster finding 
may result in a further reduction of the scatter in Lx at 
fixed richness. 

We also performed extensive tests on the robustness of 
the richness estimator A to the details of the measure- 
ment. We find that our richness estimator is robust to 
various modifications, including: 

1. It is robust to the choice of optical bands used 
for color selection, provided the bands straddle the 
4000 A break. 



2. It is robust to changes in the overall background 
normalization, for changes < 50% for the richest 
clusters. 

3. It is robust to moderate changes in the intrinsic 
width of the red sequence. 

4. It is robust to uncertainty in the photometric zero- 
point up to ±0.1 mag. 

5. It is robust to uncertainty in the color-redshift re- 
lation. In particular, consistent results can be ob- 
tained by fitting the red-sequence directly for each 
individual cluster as are obtained for the global 
model. 

The uncertainty associated with most of these effects is 
A A w 1 — 2, which is significantly smaller than the intrin- 
sic scatter of the richness-mass relation. Consequently, 
one can implement our optical richness estimator regard- 
less of the details of the optical data at hand, and be con- 
fident that the resulting richness estimates can be fairly 
compared to those from other data sets. Appendix [X] 
contains a summary of how to implement our richness 
estimator A. Finally, in order to try to improve its use- 
fulness, we provide a preliminary mass-richness relation 
for A in Appendix [EJ We emphasize that this mass cali- 
bration is preliminary, and that a robust calibration with 
well understood error must await for future work. 

We believe that the method for calculating A is close 
to an optimal richness estimator for photometric cata- 
logs, while the parameters described here are optimized 
for this particular cluster sample. Importantly, this esti- 
mator can be applied irrespective of the cluster selection 
algorithm, while its robustness to the details of the im- 
plementation ensure that one can fairly compare differ- 
ent data sets. Moreover, the detailed understanding we 
have gained on how to properly estimate cluster richness 
and galaxy membership of galaxy clusters can help guide 
cluster finding efforts. Indeed, we are currently develop- 
ing such a cluster finding algorithm. In short, we are 
confident that the detailed studies we have performed in 
this context will prove to be of critical importance for 
maximizing the cosmological utility of upcoming optical 
surveys such as the Dark Energy Survey, Pan-STARRS, 
and Hyper-SuprimeCam. 
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APPENDIX 



A USER FRIENDLY GUIDE FOR IMPLEMENTING THE RICHNESS ESTIMATOR A 

In this appendix, we lay out the "recipe" for implementing the richness estimator A. First, we list the "ingredi- 
ents" that are necessary, followed by the "cooking instructions." As described in Section [SJ several substitutions for 
certain ingredients can be made without a significant effect on the richness parameter. We make note of the possible 
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substitutions below. 

Ingredients 

1. All galaxies within ~ 1.25 /i -1 Mpc of the cluster center. The richest maxBCG cluster has A = 216 and R c = 
1.17 h~ l Mpc, so this is the largest size that is needed. 

2. If the cluster redshift z is between 0.1 and 0.3, you will need the g — r color (c) of each galaxy. As described 
in Section 15.11 any pair of sensitive bands that straddle the 4000 A break may be substituted. Similarly, if you 
work at higher or lower redshift, you will need to change your filter appropriately. 

3. The z-band magnitude of each galaxy, or a suitable red filter. 

4. The fc-corrected value of 0.2L* for the band in question. For SDSS i-band and 0.05 < z < 0.35, m*(z) is well 
described by Eqn. II H and the luminosity cut is given as m* + 1.75 mag. 

5. The mean galaxy background as a function of color and magnitude (see Sections 13.41 and |5.2|) . 

6. The ridgeline slope and intercept, as described in Eqn. [23] Alternatively, you can fit for the ridgeline slope and 
intercept with o"; nt = 0.05 from the list of galaxies. 

"Baking" Instructions 

1. Calculate the luminosity filter value for each galaxy, 4>(m), as described in Eqn. [10] in Section [3~2l Note that the 
filter is normalized such that the integral from m = — oo to m cut is exactly one. 

2. Calculate the radial filter value for each galaxy, 2irRY,(R), as described in Eqn. [B] in Section |3~T1 The radial 
filter needs to be normalized by integrating R c as per equation [5] Thus, the amplitude of is a function of 
R Cl with the normalization constant for the particular implementation of the NFW profile adapted in this work 
given in Eqn. [9] 

3. Calculate the color filter value for each galaxy, G(c,m\z), as described in Eqn. [51] in Section |4~B"1 

4. Calculate the background filter value for each galaxy, b(m,c) = 27riffi g (m, c)/C(z) 2 . We have defined Y> g (m, c) 
as the mean galaxy background in iV/sq.deg./mag/mag, and the conversion factor C{z) is given in degrees/Mpc 
at the redshift of the cluster. 

5. Using a zero-finder (e.g., bisector or Newton's method), solve the following equation: 

= A- Yl Pi ( Al ) 

R<R C (X) 

where 

= \2irR l Z{R l )cj ) {m t )G{c l ) 
pl X2^R i Yl(R i )ct>{mi)G{c i ) + b{m i ,c i ) [ ' 

and 

R C (X) =i? (A/100.0) /3 . (A3) 

Note that once the root A of equation IA1I has been found, equation IA2I can be used to estimate the membership 
probability of every galaxy. 

6. To estimate the statistical error in A, use equation [3] This measurement error is typically very small, of order 
AA/A = 0.4A- 1 / 2 , or « 4% (9%) for A = 100 (15). 

Sample IDL code that will calculate A for SDSS data is available at http://kipac.stanford.edu/maxbcg/ 

THE MASS-RICHNESS RELATION 

The goal of this paper was to extensively test and optimize the richness measure proposed bv iRozo et~aT1 (l2009bft . 
A careful calibration of the mass-richness relation is beyond the scope of the current paper, though we do intend 
to address this problem in subsequent work. Nevertheless, we felt it was important to provide a rough calibration 
that may be used for comparison purposes, and to test the efficacy of our estimator. To this end, we have relied 
on abundance matching techniques. Briefly, we compute the cumulative cluster abundance function A r c i uste rs(> A), 
defined as the number of maxBCG clusters of richness A or higher . We then al s o com pute the expected cumulative 
mass function Ah a ios(> tn) as a function of mass by integrating the lTinker et all (|2008| ) mass function for our fiducial 
cosmology over the maxBCG survey volume. 22 When computing the mass function, we adopt as our fiducial mass 
definition Af200m, the mass contained with a 200 overdensity relative to the mean matter density. A cluster of richness 

22 maxBCG clusters probe the redshift region z £ [0.1,0.3] over a survey area f2 = 2.25356 srad. 
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Fig. BIO. — Left: The scatter in mass at fixed richness, as estimated via equation IB2I The scatter is estimated using all clusters above 
a given richness, as opposed to using narrowly binned samples. The triangles with error bars show the recovered scatter assuming r = 0, 
while the dashed lines illustrated how the scatter changes as we vary the correlation coefficient between Lx and A at fixed mass. The 
small vertical solid line along the x axis marks the richness threshold for the top-2000 clusters. Below this richness, we expect the scatter 
is co mpro mised due to the original maxBCG selection. Right: Mass as a function of A obtained via density matching, scaled to Msorjc 
(Eqn. IB6I I. Dotted lines are the expected 90% scatter contours obtained from the X-ray constraints. For r eference, X-ray ma sses from 
two cluster samples that overlap the m axBCG footprint are ove rplotted. Blue squares are from BCS sample {Mantz ct al. 20H3), and red 
triangles are from the LoCuSS sample (Zhang ct al. 2008, 2010). Sec text for discussion of the possible origin of the normalization offsets 
between the two data sets. In each case, the observed mass scatter agrees well with the predicted mass scatter. 



A is assigned a mass M by solving the equation iVciusters(> A) = A r h a ios(> M) for M(A). We call this mass estimate 
the density mat ching mass, denoted Md m (A). 

As detailed in iMortonson et all (|2011h . mass estimates from density matching are expected to be biased high relative 
to the Bayesian posterior by an amount equal to \~f<J 2 in the log. Here, 7 is the slope of the halo mass function at 
mass M, and a is the scatter in mass at fixed richness. Thus, in order to correct for this bias, we must first estimate 
the scatter in mass at fixed richness for A. To do so, we rely on the scatter in Lx at fixed richness. Assuming 
hiLx = a + ahxM, it follows that a scatter in mass <Jm\\ corresponds to a scatter in Lx given by u<jm\n- If Lx and 
A arc uncorrelatcd, then on simply needs to add in quadrature the intrinsic scatter in Lx at fixed mass, (Jl x \Mi to 
arrive at the total scatter in Lx at fixed richness: 

a l x \\ = a L x \M + a L x \M (T M\\- ( B1 ) 

With this equation, one can straightforwardly solve for <tm\\- I n the slightly more general case when Lx and A are 
correlated, one finds 



pm\\ = ra M \L x + 



~Z2 u m\l x 
L x \ M 



-,1/2 

(B2) 



In the above equation, r is the unknown correlation coefficient between the richness A and Lx at fixed mass, while 
a L x \M is the sl ope of the Lx — M relation. We adopt the fiducial values Q-l x \m = 1-61 and <Jm\l x = 0.246 as per 
iVikhlinin et al.l (|2009() . As for the correlation coefficient r, because the scatter in Lx is dominated by emission from 
the core, we do not expect A and Lx to be strongly correlated. Here, we simply consider three values for r, r = ±0.3, 
and r = 0. Setting the scatter (Jl x \\ = 0.63 as appropriate for our top 2000 clusters, we arrive at ctm\\ = 0-31+ ; 07 , 
where the error bars reflect the change in the scatter for r = ±0.3. Figure [Biol (left panel) illustrates how our recovered 
scatter <7m\\ changes as a function of the minimum richness A of the sample under consideration. 

One interesting feature of the scatter in mass at fixed richness as a function of A is that the the scatter appears to 
increase slowly with decreasing richness for A > 60, but begins to climb much faster below A < 60. Remarkably, in 
Paper II we found that A « 60 is the richness at which the miscentcring of maxBCG galaxy clusters is expected to 
become important. Moreover, as we argue in Paper II, cluster miscentering "turns-on" very quickly. Consequently, it 
is possible that the rapid rise of the scatter with decreasing richness below A < 60 does not reflect the true intrinsic 
scatter of the estimator A, but is rather a reflection of the miscentering properties of maxBCG clusters. Above A > 60, 
however, we do not expect cluster miscentering to play as significant a role. Thus, it is likely that the observed scatter 
a t A > 60 is close to the intrinsic scatter of our richness estimator. 

Based on the results shown in Figure IBIOI (left panel), we adopt a fiducial scatter (Tm\\ = 0.25. With this scatter in 
hand, we can correct the density matched masses by the expected bias. If A/dm (A) is the density matched mass of a 
cluster of richness A, we set the cluster's final mass to 

M(A) = cxp (-\l{M dm {\))al nx \ M dm (A) (B3) 
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where 7(Md m ) is the slope of the halo mass function dN/dlrxM oc M~ 7 evaluated at the density matching mass Md m . 
Having assigned a mass to every cluster in this fashion, the resulting mass-richness relation is fit to a power-law using 
all clusters of richness A > 60. We restrict ourselves to these clusters given that we believe cluster miscentering might 
be starting to become important below this richness. We finally arrive at 

ln ( u-UnT*r ) =1 - 72 + lm H^O). (B4) 
\ft 70 1U MqJ 

Note that we have scaled the masses relative to h = i/o/lOOkms -1 Mpc" 1 = 0.7, and we have made explicit that this 
scaling relation is appropriate for a mass overdensity of 200 relative to the mean matter density. Equation IB4I is our 
proposed scaling between mass and richness, while <Jm\\ = 0-25 is our fiducial value for the scatter in mass at fixed 
richness. This value my be somewhat overestimated at the high mass end. Note, however, that the amplitude A of this 
relation can shift by w 0.1 depending on the value of the correlation coefficient r, the choice of fiducial cosmology, etc. 
Adopting a 20% systematic uncertainty the overall amplitude, and adding in quadrature to the expected statistical 
uncertainty <Jm\\ = 0.25, we find that the total uncertainty in the mass of any given cluster is 0.33 at the la level. 

We can also compute the corresp onding mass-richness re lations for other mass definitions by rescaling all assigned 
cluster masses using the formulas in lHu fc Kravtsovj (|2003l ). and refitting to a power-law. We find 

K;^^)= L14+L041 " (A/60) - (B6) 

We emphasize again, however, that these are not meant to be a rigorous mass calibration, a problem that we defer to 
future work. 

As a check of our mass calibration, we have assembled two cluster samples that overlap the maxBCG footprint 
and redshift rang e. The first sample i s drawn from the LoCuSS ob servations of high luminosity clusters from RASS 
cluster catalogs (jEbeling et al.l 119981 [2000: Boh ringer et al.l l2004j) . These cl usters have been observed with both 
XMM/Newton to obtain hydrostatic mass estimates (Zha ng et al.l 120081 I2010D, with S ubaru wealk lensing follow-up 
to provide independent estim ates of the masses of sev eral clusters (jZhang et al.l I2010T ) . Here we conc entrate on the 
hydro static mass estimates of IZhang et "all (|2008L l2010T >. The ROSAT Brightest Clus ter Sample (BCS: labeling et all 
Il998h has been observed with Chandra, with hydrostatic mass estimates obtained by iMantz et al.l (|2010l ) . The X-ray 
mass values for the BCS clusters have been reduced by 11% to account for the Chandra calibration update described in 
that paper (Mantz, A., private comm.). For each cluster, we have used the X-ray center and spectroscopic redshift to 
estimate A from SDSS DR7 photometric data. We emphasize that we cannot use this data to provide a rigorous mass 
calibration due to the fact that these cluster samples do not constitute a random sampling of the maxBCG clusters. 
We only use this data for illustrative purposes, and to test whether the mass calibration derived from density matching 
is reasonable. 

Figure IbToI (right panel) shows the mass as a function of A obtained via density matching, scaled to M^ c (Eqn. |B6|) . 
Dotted lines are the expected 90% confidence interval assuming an uncertainty AlnAf = 0.33 as discussed above (25% 
intrinsic scatter, 20% systematic uncertainty). The LoCuSS sample is overplotted with red triangles, and the BCS 
clusters are shown with blue squares. It is clear that there is a normalization offset between the two data sets, which 
corresponds to a systematic offset AlnM = 0.45 in mass. A full accounting of this offset is beyond the scope of 
this paper, but there are several possibilities. First, the analyses were done with different data sets (XMM and 
Chandra); second, there is an additional 10% systematic uncertainty in the normalization of the BCS clusters due 
to the uncertainty in / gas used to calculate the masses; third, because the masse s are calcula t ed in a scaled aperture 
( r 500c), any slight normalization offset is magnified in the full analysis; fourth, IZhang et al.l (|2008l ) note that if one 
changes their parametrization of the temperature profile below rj 0.5i?5oo so as to allow for a rapid drop in the cluster 
temperature, their hydrostatic masses can increase by as much as 25%. 

Regardless of what the ultimate source of the difference in the normalization between the two data sets is, it is 
reassuring to see that our rough mass calibration fits between these two data sets. Moreover, the scatter in the mass- 
richness relation is clearly smaller than the overall difference in normalization between the two sets, and consistent with 
our 25% estimate for the scatter. This demonstrates both that the richness estimator A is indeed tightly correlated 
with cluster mass with scatter at the ~ 20% — 30% level, and that our 33% estimate for the uncertainty in the mass of 
any one cluster is reasonable. Note that s ince the scatter in mass at fixed richness for the original maxBCG richness 
(A200) was estimated to be a\ n m\n — 

0.45 lRozo et"afl (|2009ah . this means that even this rough mass calibration allows 
us to predict individual cluster masses with significantly higher precision than we could using the maxBCG richness. 

THE AUGMENTED MAXBCG CLUSTER CATALOG 

Table IC2I contains the improved richness, A, for each cluster in the maxBCG cluster catalog. The positions (RA, 
Dec), photometric redshifts (z), and original richness estimate (A^oo) are taken directly from Table 1 in lKoester et al.1 
(|2007al ). The spectroscopic redshifts (zf^?), are obtained by cross-identifying the maxBCG BCG positions with the 
full spectroscopic catalog from SDSS DR8 23 , thus increasing the number of BCGs with spectra from 5413 to 9409. 

23 http:/ /www. sdss.org/dr8/spectro/ 
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Table C2. maxBCG Cluster Catalog With A Richness 



R.A. (dog) 


Decl. (deg) 


z 


BCG 
^"spcc 


^200 


A 


A e 


239.58334 


27.233419 


0.103 


0.091 


188 


199.82 


3.84 


140.10742 


30.494063 


0.292 


-1.000 


126 


181.24 


6.22 


198.77182 


51.817380 


0.286 


-1.000 


87 


172.22 


5.00 


126.37104 


47.133478 


0.135 


0.129 


99 


147.35 


3.72 


203.16008 


50.559919 


0.284 


-1.000 


114 


174.03 


5.71 


354.41554 


0.271383 


0.286 


0.277 


88 


129.53 


4.85 


213.78496 


-0.493247 


0.135 


0.139 


115 


77.89 


3.34 


189.24684 


63.186584 


0.294 


-1.000 


89 


114.17 


4.81 


216.48612 


37.816455 


0.167 


0.170 


98 


118.92 


3.62 


187.70363 


10.546381 


0.167 


0.170 


70 


120.36 


4.07 



Note. — A full electronic version of Table |C21 is fits format is available at http://kipac.stanford.edu/maxbcg/ A portion is shown 
here for guidance regarding its form and content. 



Finally, the improved richness and error estimates (A, A e ) are taken from this work. Note that the catalog is complete 
in A^oO) but is not complete in A. We set zfs£? = —1.0 when a spectrum is not available, and set A = —1.0 when 
no significant number of red galaxies above background is found by the richness estimator. In addition, we note that 
the DR7 galaxy catalog used to calculate A covers a slightly different footprint than the original DR4 catalog used 
to construct the original maxBCG catalog. There are 80 maxBCG clusters that we cannot calculate reliable richness 
estimates as they are in newly masked regions. For these, we set A = —2.0 in the catalog. 



